Accent-specific Mandarin adaptation based on pronunciation modeling technology
نویسندگان
چکیده
An accent adaptation approach using pronunciation variation modeling technology for Mandarin accent was proposed in this paper. As Chinese language is monosyllabic, the syllable pronunciation variation dictionary (SPVD) was built to depict the characteristics of accent. Firstly, the pronunciation modeling technology was utilized to get the context-independent and contextdependent accent-specific syllable confusion matrix according to the acoustic recognition results (pin-yin stream). Then the accentspecific Chinese SPVD was constructed from this confusion matrix. Finally, N-Best acoustic recognition candidates were rescored with the help of SPVD. To curtail the necessary adaptation data size for context-dependent SPVD, we divided the syllable context into several groups. The experiment results show that pronunciation variation modeling technology is an effective method for Mandarin accent adaptation, and the context grouping strategy can reduce the adapting speech data effectively while keep the same satisfactory performance.
منابع مشابه
Accent modeling based on pronunciation dictionary adaptation for large vocabulary Mandarin speech recognition
A method of accent modeling through Pronunciation Dictionary Adaptation (PDA) is presented. We derive the pronunciation variation between canonical speaker groups and accent groups and add an encoding of the differences to a canonical dictionary to create a new, adapted dictionary that reflects the accent characteristics. The pronunciation variation information is then integrated with acoustic ...
متن کاملPronunciation variation modeling for Mandarin with accent
In order to solve the problem of the performance decrease when state-of-art automatic speech recognition (ASR) system facing accent speech, we propose the Pronunciation Variation Model (PVM). Two approaches are proposed to construct the PVM in this paper. 6.38% and 7.78% relative error rate reduction is achieved for Shanghai and Wuhan accent mandarin, respectively. The experiment on these two t...
متن کاملAccent Issues in Large Vocabulary Continuous Speech Recognition
Speech recognition has achieved great improvements recently. However, robustness is still one of the big problems, e.g. performance of recognition fluctuates sharply depending on the speaker, especially when the speaker has strong accent that is not covered in the training corpus. In this report, we first introduce our result on cross accent experiments and show a 30% error rate increase when a...
متن کاملAcoustic and Lexical Modeling Techniques for Accented Speech Recognition
Speech interfaces are becoming pervasive among the common public with the prevalence of smart phones and cloud-based computing. This pushes Automatic Speech Recognition (ASR) systems to handle wide range of environments including different channels, noise conditions and speakers with varying accents. This thesis focuses on the impact of speakers’ accents on the ASR models and techniques to make...
متن کاملAccent detection and speech recognition for Shanghai-accented Mandarin
As speech recognition systems are used in ever more applications, it is crucial for the systems to be able to deal with accented speakers. Various techniques, such as acoustic model adaptation and pronunciation adaptation, have been reported to improve the recognition of non-native or accented speech. In this paper, we propose a new approach that combines accent detection, accent discriminative...
متن کامل